Text-To-Speech Intelligibility Across Speech Rates

نویسندگان

Ann K. Syrdal

H. Timothy Bunnell

Susan R. Hertz

Taniya Mishra

Murray F. Spiegel

Corine Bickley

Deborah Rekart

Matthew J. Makashay

چکیده

A web-based listening test measured intelligibility across speech rate of 8 TTS systems and of a linearly timecompressed human speech reference voice. The synthesis systems included 2 independent representatives of each of the following 4 synthesis methods: formant, diphone concatenation, unit selection concatenation, and HMM. For each TTS system, a female and a male American English voice were tested. Semantically unpredictable sentences were presented at 6 speech rates from 200 to 450 words per minute. In an open response format, listeners typed what they heard. Listener transcriptions were automatically scored at the word level, and a normalized edit distance per speech rate was calculated for each of 355 listeners. There were significant differences among the TTS systems. The 2 unit selection TTS systems were the most intelligible across speech rates; one was equivalent to human speech. Listeners’ native language, TTS familiarity, and audio equipment were also significant factors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech intelligibility after repair of cleft lip and palate

    Background: Intelligibility refers to understandability of speech; and lack of it can negatively affect children’s overall communication effectiveness. Children with repaired cleft lip and/or cleft palate (CL/P) may experience poor speech intelligibility. This study aimed at evaluating speech intelligibility in children with repaired CL/P who had not been referred to sp...

متن کامل

Speech Intelligibility of Cochlear-Implanted and Normal-Hearing Children

Introduction: Speech intelligibility, the ability to be understood verbally by listeners, is the gold standard for assessing the effectiveness of cochlear implantation. Thus, the goal of this study was to compare the speech intelligibility between normal-hearing and cochlear-implanted children using the Persian intelligibility test. Materials and Methods: Twenty-six cochlear-implanted childre...

متن کامل

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Recently permutation multimedia ciphers were broken in a chosen-plaintext scenario. That attack models a very resourceful adversary which may not always be the case. To show insecurity of these ciphers, we present a cipher-text only attack on speech permutation ciphers. We show inherent redundancies of speech can pave the path for a successful cipher-text only attack. To that end, regularities ...

متن کامل

Speech Intelligibility in Persian Children with Down Syndrome

Objectives: One of the most effective methods to describe speech disorders is the measurement of speech intelligibility. The speech intelligibility indicates the extent of acoustic signals that correctly speaker produces and hearer receives. The purpose of this study was to investigate the speech intelligibility in the Persian children with Down syndrome, age range was 3 to 5 years, who had spo...

متن کامل

Text to Speech Synthesis System for Tamil

In a text-to-speech system, spoken utterances are automatically produced from text. In this paper, we present a corpus-driven Tamil text-to-speech (TTS) system based on the concatenative synthesis approach. The most important qualities of a synthesized speech are naturalness and intelligibility. In this system, words and syllables are used as the basic units for synthesis. Our corpus consists o...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2012

Text-To-Speech Intelligibility Across Speech Rates

نویسندگان

چکیده

منابع مشابه

Speech intelligibility after repair of cleft lip and palate

Speech Intelligibility of Cochlear-Implanted and Normal-Hearing Children

Cipher text only attack on speech time scrambling systems using correction of audio spectrogram

Speech Intelligibility in Persian Children with Down Syndrome

Text to Speech Synthesis System for Tamil

عنوان ژورنال:

اشتراک گذاری